Search results for "pedestrian simulation"

showing 2 items of 2 documents

Using Inverse Reinforcement Learning with Real Trajectories to Get More Trustworthy Pedestrian Simulations

2020

Reinforcement learning is one of the most promising machine learning techniques to get intelligent behaviors for embodied agents in simulations. The output of the classic Temporal Difference family of Reinforcement Learning algorithms adopts the form of a value function expressed as a numeric table or a function approximator. The learned behavior is then derived using a greedy policy with respect to this value function. Nevertheless, sometimes the learned policy does not meet expectations, and the task of authoring is difficult and unsafe because the modification of one value or parameter in the learned value function has unpredictable consequences in the space of the policies it represents…

0209 industrial biotechnologyreinforcement learningComputer scienceGeneral Mathematics02 engineering and technologypedestrian simulationTask (project management)learning by demonstration020901 industrial engineering & automationAprenentatgeInformàticaBellman equation0202 electrical engineering electronic engineering information engineeringComputer Science (miscellaneous)Reinforcement learningEngineering (miscellaneous)business.industrycausal entropylcsh:MathematicsProcess (computing)020206 networking & telecommunicationsFunction (mathematics)inverse reinforcement learninglcsh:QA1-939Problem domainTable (database)Artificial intelligenceTemporal difference learningbusinessoptimizationMathematics
researchProduct

Emergent behaviors and scalability for multi-agent reinforcement learning-based pedestrian models

2017

This paper analyzes the emergent behaviors of pedestrian groups that learn through the multiagent reinforcement learning model developed in our group. Five scenarios studied in the pedestrian model literature, and with different levels of complexity, were simulated in order to analyze the robustness and the scalability of the model. Firstly, a reduced group of agents must learn by interaction with the environment in each scenario. In this phase, each agent learns its own kinematic controller, that will drive it at a simulation time. Secondly, the number of simulated agents is increased, in each scenario where agents have previously learnt, to test the appearance of emergent macroscopic beha…

Engineeringmedia_common.quotation_subject02 engineering and technologyPedestrianMachine learningcomputer.software_genreConsistency (database systems)Robustness (computer science)0202 electrical engineering electronic engineering information engineeringReinforcement learningQuality (business)Macromedia_commonInformáticaPedestrian simulation and modelingKinematic controllerbusiness.industry020207 software engineeringEmergent behavioursBehavioural simulationHardware and ArchitectureModeling and SimulationScalability020201 artificial intelligence & image processingArtificial intelligencebusinessMulti-agent reinforcement learning (Marl)computerSoftwareSimulation Modelling Practice and Theory
researchProduct